Predicting Intonational Boundaries Automatically from Text: The ATIS Domain

نویسندگان

Michelle Q. Wang

Julia Hirschberg

چکیده

Rela t ing the intonat ional characteristics of an u t ter ance to other features inferable f rom its text is impor t an t bo th for speech recognition and for speech synthesis. This work investigates techniques for predic t ing the locat ion of intonat ional phrase boundaries in na tu ra l speech, th rough analyzing a ut terances from the D A R P A Air Travel In format ion Service database. For s ta t is t ical model ing, we employ Classification and Regression Tree ( C A R T ) techniques. We achieve success rates o f jus t

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Predicting Intonational Phrasing from Text

Determining the relationship between the intonational characteristics of an utterance and other features inferable from its text is important both for speech recognition and for speech synthesis. This work investigates the use of text analysis in predicting the location of intonational phrase boundaries in natural speech, through analyzing 298 utterances from the DARPA Air Travel Information Se...

متن کامل

Automatic Classi cation of Intonational Phrase Boundaries

The relationship between the intonational characteristics of an utterance and other features inferable from its text represents an important source of information both for speech recognition, to constrain the set of allowable hypotheses, and for speech synthesis, to assign intonational features appropriately from text. This work investigates the usefulness of a number of textual features and ad...

متن کامل

The ATIS Sign Language Corpus

Systems that automatically process sign language rely on appropriate data. We therefore present the ATIS sign language corpus that is based on the domain of air travel information. It is available for five languages, English, German, Irish sign language, German sign language and South African sign language. The corpus can be used for different tasks like automatic statistical translation and au...

متن کامل

Training intonational phrasing rules automatically for English and Spanish text-to-speech

We describe a procedure for acquiring intonational phrasing rules for text-to-speech synthesis automatically, from annotated text, and some evaluation of this procedure for English and Spanish. The procedure employs decision trees generated automatically, using Classi cation and Regression Tree techniques, from text corpora which have been hand-labeled by native speakers with likely locations o...

متن کامل

ارائه مدلی برای استخراج اطلاعات از مستندات متنی، مبتنی بر متن‌کاوی در حوزه یادگیری الکترونیکی

As computer networks become the backbones of science and economy, enormous quantities documents become available. So, for extracting useful information from textual data, text mining techniques have been used. Text Mining has become an important research area that discoveries unknown information, facts or new hypotheses by automatically extracting information from different written documents. T...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1991

Predicting Intonational Boundaries Automatically from Text: The ATIS Domain

نویسندگان

چکیده

منابع مشابه

Predicting Intonational Phrasing from Text

Automatic Classi cation of Intonational Phrase Boundaries

The ATIS Sign Language Corpus

Training intonational phrasing rules automatically for English and Spanish text-to-speech

ارائه مدلی برای استخراج اطلاعات از مستندات متنی، مبتنی بر متن‌کاوی در حوزه یادگیری الکترونیکی

عنوان ژورنال:

اشتراک گذاری